Sequencing from compomers in the presence of false negative peaks
نویسندگان
چکیده
One of the main endeavors in today’s Life Science remains the efficient sequencing of long DNA molecules. Today, most de-novo sequencing of DNA is still performed using electrophoresis-based Sanger Sequencing, based on the Sanger concept of 1977. Methods using mass spectrometry to acquire the Sanger Sequencing data are limited by short sequencing lengths of 15–25 nt. Recently, we proposed a new method for DNA sequencing using base-specific cleavage and mass spectrometry, that appears to be a promising alternative to classical DNA sequencing approaches. This leads to the combinatorial problem of Sequencing From Compomers (SFC) and, finally, to the graph-theoretical problem of finding a walk in a subgraph of the de Bruijn graph. Simulations indicate that this method might be capable of sequencing DNA molecules with 200+ nt. But the way the Sequencing From Compomer Problem is formulated, it does not take into account the problem of false negative peaks that is common for real-world data: Even though an in silico simulation predicts a peak to be present in a mass spectrum, it is absent from the measured mass spectrum. We may evade this problem by choosing a very sensitive peak detection algorithm, minimizing the number of false negative peaks. Still, a single false negative peak is usually sufficient to prohibit reconstruction of the correct DNA sequence by SFC. Here, we show how to extend SFC as well as sequencing graphs to deal with false negative peaks. In addition, we present a branch-and-bound algorithm to find all sequences that agree with the sample mass spectra with the exception of a certain number of false negative peaks. Simulation results indicate that even in the presence of several false negative peaks, the presented method might be capable of sequencing DNA molecules of length 200 nt. Contact: Sebastian Boecker AG Genominformatik Technische Fakultaet Universitaet Bielefeld PF 100 131 33501 Bielefeld Germany [email protected] Date: September 12, 2003. Sebastian Böcker is currently supported by “Deutsche Forschungsgemeinschaft” (BO 1910/1-1) within the Computer Science Action Program.
منابع مشابه
Weighted Sequencing from Compomers: DNA de-novo sequencing from mass spectrometry data in the presence of false negative peaks
One of the main endeavors in today’s Life Science remains the efficient sequencing of long DNA molecules. Today, most de-novo sequencing of DNA is still performed using electrophoresis-based Sanger Sequencing introduced in 1977, in spite of certain restrictions of this method. Recently, we proposed a new method for DNA sequencing using base-specific cleavage and mass spectrometry, that appears ...
متن کاملRetraining over the principles and mechanisms involved in the occurrence of false results from urine drug screening tests: Adulteration and strategies to defeat
Screening tests (UDSTs) for the diagnosis of psychoactive drugs can identify drug abuse, improve workplace safety, ensure community health, and play a critical role in therapeutic drug monitoring. Nonetheless, correct interpretation of the results of these tests requires a full awareness of the principles of testing methods, drug kinetics, and various leading causes of false results. Among the ...
متن کاملمشخصه جریان- ولتاژ یک دیود تونل زنی تشدیدی تحت تابش موج الکترومغناطیسی
In this paper, current-voltage characteristic of a resonant tunneling diode under electromagnetic radiation has been calculated and compared with the results when there is no electromagnetic radiation. For calculating current -voltage characteristic, it is required to calculate the transmission coefficient of electrons from the well and barrier structures of this device. For calculating the tr...
متن کاملCutibacterium Acnes is Isolated from Air Swabs: Time to Doubt the Value of Traditional Cultures in Shoulder Surgery?
Background: Given high rates of positive Cutibacterium acnes (C. acnes) cultures in cases of both primary and revisionshoulder surgery, the ramifications of positive C. acnes cultures remain uncertain. Next generation sequencing (NGS)is a molecular tool that sequences the whole bacterial genome and is capable of identifying pathogens and the relativepercent abundance in which ...
متن کاملکاربرد آنالیز طیفی بیزی در تحلیل سریهای زمانی نورسنجی
The present paper introduces the Bayesian spectral analysis as a powerful and efficient method for spectral analysis of photometric time series. For this purpose, Bayesian spectral analysis has programmed in Matlab software for XZ Dra photometric time series which is non-uniform with large gaps and the power spectrum of this analysis has compared with the power spectrum which obtained from the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003